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Abstract. For a random quantum state on H = C d C d obtained by partial tracing a random 
pure state on H <£) C s , we consider the question whether it is typically separable or typically 
entangled. For this problem, we show the existence of a sharp threshold sq = so{d) of order 
roughly d 3 . More precisely, for any e > and for d large enough, such a random state is entangled 
with very large probability when s ^ (1 — e)sq, and separable with very large probability when 
s ^ (l+e)so- One consequence of this result is as follows: for a system of ./V identical particles in a 
random pure state, there is a threshold fco = ko(N) ~ N/5 such that two subsystems of k particles 
each typically share entanglement if k > fco, an d typically do not share entanglement if fc < fco. 
Our methods work also for multipartite systems and for "unbalanced" systems such as C dl (£> C d2 , 
di 7^ di- The arguments rely on random matrices, classical convexity, high-dimensional probability 
and geometry of Banach spaces; some of the auxiliary results may be of reference value. 



1. Introduction 

In recent years, random constructions have become a very fruitful tool in quantum information 
theory The study of random channels and random states has particularly intensified since the 
influential paper [21]. The most successful achievement of the probabilistic method in quantum 
information theory is arguably Hastings's proof that suitably chosen random channels provide a 
counterexample to the additivity conjecture for classical capacity of quantum channels [20] . 

In this paper, we address the most fundamental question one can ask about a random state: 
is it entangled? Detecting and exploiting entanglement, first discovered in the 1930's p3], is a 
central problem in quantum information and quantum computation at least since Shor's work 
[55] on integer factoring. However, the structure of the set of entangled quantum states is still 
not well-understood. The well-known Peres-Horodecki positive partial transpose (PPT) criterion 
[22j[33] is a necessary condition for (lack of) entanglement, but this condition is sufficient only for 
qubit-qubit and qubit-qutrit systems [39| H6]. 

We consider here a family of random states that are known as random induced states. These 
are mixed states on % obtained after partial tracing, over some ancilla space "H a , a uniformly 
distributed pure state on % ® T-i a - This leads to a natural family of probability measures on the 
set of states on % (see [SB IFJ), where dim"H a , the dimension of the environment, is a parameter. 
Indeed, if all that we know about the system % ®% a are the dimensions of the factors % and % ai 
and that it is isolated from the rest of the environment, the corresponding random induced state 
is a reasonable model for, or at least a reasonable first guess about the state of the system %. 
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Of course, the induced state p being random, we cannot expect to be able to tell what p is. 
However, we may be able to infer some properties of p if they are generic (that is, occur with 
probability close to 1) for a given random model. For specificity, consider T-L = C d ® C d and let 
us focus on the "Is the random induced state entangled?" question. As it turns out, the answer 
depends in a crucial and rather precise way on the size of the environment with low- dimensional 
environments leading typically to entangled states and high-dimensional environments leading to 
separable states. 

In the special case dimI-L a = dim'H = d 2 , we are led to the uniform distribution on the set 
of states (i.e., uniform with respect to the Hilbert-Schmidt volume). As was shown in [3J, the 
proportion of states (measured with respect to the Hilbert-Schmidt volume) that are separable is 
extremely small in large dimensions. This means that when dim"H a = d 2 , random induced states 
are typically entangled. This was extended to the case when dimI-L a is slightly larger than d 2 in 
US [2]. 

On the other hand, it was proved in [21] that random induced states on C d (g) C d are typically 
separable when dimH a is proportional to d 4 . In the present paper, we bridge the gap between 
these estimates and show that the threshold so(d) between separability and entanglement occurs 
at order d 3 (more precisely, we obtain the inequalities cd 3 ^ s (d) ^ Cd 3 log 2 d, where C, c > are 
universal constants, independent of the dimensions involved). More specifically, we show that for 
any e > 0, the following holds if d is large enough. When the environment dimension s = dim"H a 
is smaller than (1 — e)sq, the random induced state is entangled with overwhelming probability. 
When the environment dimension exceeds (1 + s)sq, the random induced state is separable with 
overwhelming probability. 

The heuristics behind the consequence stated in the abstract is now as follows. If we have a 
system of N particles (with D levels each) which is in a random pure state, and two subsystems of 
k particles each, then the "joint state" of the subsystems is modeled by a random induced state on 
C d <g)C d with d = D k and s = D N ~ 2k . In particular, the relation k = N/5, or N = 5k, corresponds 
exactly to s = d 3 . The reason for the threshold effect is that passing from k to k — 1 increases 
s by a factor of D 2 ^ 4, while - as stated above - the transition from generic entanglement to 
generic separability takes place when s is increased only slightly (the simultaneous decrease of d 
by a factor of D ^ 2 only amplifies the effect). 

Our method of proof is geometric and uses tools from high-dimensional convexity, a field also 
known as asymptotic geometric analysis. This has become a fruitful approach to study the geome- 
try of quantum states in large dimension, and recent contributions include, for instance, [H |4"5| IT4"] . 
Asymptotic geometric analysis strives to understand properties of geometric structures in high di- 
mension, which - because of the "central limit theorem-like" effects - are believed to be relatively 
easier to pinpoint than those in low dimension. The phenomenon of concentration of measure 
plays a central role in the whole theory. 

While our starting point is an estimation of the Hilbert-Schmidt volume of the set S of separable 
states on C d ®C d , the relevant geometric parameter to estimate is not the volume itself, but rather 
the mean width of the dual set S° . We estimate it in an indirect way, using the MM*-estimate, a 
general result from asymptotic geometric analysis. 
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Our method also applies to the case of multipartite systems (C d )® fc . In the asymptotic limit, 
when k is fixed and d goes to infinity, we show that the threshold for separability vs. entanglement 
occurs for s of order d 2k ~ l (up to logarithmic factors). However, to keep the exposition as simple 
as possible, we focus on the bipartite case k — 2 in most of the paper. 

Finally, we also show that "one half" of our main result (entanglement is generic for small 
dim %a) can be proved in a more elementary way by working with the density of induced measures. 
Similar arguments appeared already in the papers [UJ, 08] . 

This paper is organized as follows. In Section [21 we introduce some mathematical background, 
necessary notation and state our main theorem. Sections [3] and H] contain the proof of the main 
theorem. In Section [51 we extend our results to the case of multipartite systems. In Section [61 we 
will provide another, more elementary, proof of our result, which is valid only in the regime where 
entanglement is generic. Miscellaneous remarks and loose ends are addressed in Section [7J The 
mathematical tools we need are collected and/or developed in several Appendices, some of which 
may be of independent interest and/or reference value. General references for concepts related to 
quantum information theory are [5] and [32], for those related to asymptotic geometric analysis 
[3l], and for those related to random matrices [1] or [T2]. A high-level non-technical overview of 
the results of this paper and of a related article [2] can be found in [5] . 

2. Notation, background and the statement of the main theorem 

The letters C,c,cq, ... denote absolute numerical constants (independent of the instance of the 
problem) whose value may change from place to place. When A, B are quantities depending on 
the dimension (and perhaps some other parameters), the notation A < B means that there exists 
an absolute constant C > such that the inequality A ^ CB holds in every dimension. Similarly 
A ~ B means both A < B and B < A. As usual, A ~ B means that A/B — > 1 as the dimension 
(or some other relevant parameter) tends to oo, while A = o(B) means that A/B — >■ 0. 

There are various dimensions appearing repeatedly in this paper, and we will stick to the fol- 
lowing notational scheme. We will work in a complex Hilbert space T-L = C", with n = d k , so that 
we may identify C n with (C d )® fc (k = 2 in most of the paper). We will always assume that d ^ 2. 
The set of states on C n has real dimension m = n 2 — 1. We also consider an ancilla space C s , and 
pure states in C n ® C s . Occasionally the unit sphere in C™ ® C s will be identified with the unit 
sphere 5 m_1 C R m , with m = 2ns. General results from convex geometry will also be stated in 
R m (except in Appendices [B] and O, where we consider R dimW , and so R n is more appropriate). 

2.1. Basic facts from convex geometry. A convex body K in a real finite-dimensional vector 
space (usually identified with R m ) is a convex compact set with non-empty interior. We denote 
by | ■ | the Euclidean norm. A convex body K is symmetric if K = —K. The convex bodies we 
consider may be non-symmetric, but we usually "arrange" that the origin belongs to the interior 
of K. The gauge associated to such K is the function || • \\k defined for x G R m by 

\\x\\ K := inf{t )0 : iG tK}. 



— x\\k when K is non-symmetric. 
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By vol we denote the Lebesgue measure on R m . The volume radius of a convex body K (zH r 
is defined as 



vrad(iT) := 



vol B% 



where B™ denotes the unit Euclidean ball. In words, vrad(i^) is the radius of the Euclidean ball 
with same volume as K. The same notation will be used if K "lives" in an m- dimensional linear 
or affine subspace of a larger space. 

If K C R m is a convex body with origin in the interior, the polar of K is the convex body K° 
defined as 

K° := {y G R m : (x, y) < 1 Va: G K}. 

The inradius of a convex body K is the largest radius r of a Euclidean ball contained in K. 
Similarly, the outradius of K is the smallest R > such that K is contained in a ball of radius R. 
In most cases the optimal balls will be centered at the origin, then r and R can be equivalently 
defined as the "best" constants for which R~ x \ ■ | ^ || • ||# ^ r~ l \ • |. 

If u is a vector from the unit sphere S m ~ l , the support function of K in the direction u is 
}ik{u) = max x& K(x, u) = \\u\\k°- Note that hx{u) is the distance from the origin to the hyperplane 
tangent to K in the direction u. The mean widthQ of K is then defined as 

(1) w(K) := / }ik(u) da(u) = / ||M||^-o(icr(ti), 

where da(u) is the normalized spherical measure on the sphere S 1 " 1-1 (this definition makes in fact 
sense for any bounded set K). 

The Urysohn's inequality (see, e.g., [31]) is a fundamental result which compares the volume 
radius and the mean width: for any convex body K C R m , we have 

(2) vrad(A') ^ w{K). 

It is often convenient to consider the Gaussian variant of mean width 

w G {K) :=E||G||*o, 

where G is a standard Gaussian vector in R m , i.e., a random vector with independent iV(0, 1) 
coordinates in any orthonormal basis. One checks, by passing to polar coordinates, that for every 
convex body K C R m 

(3) w G (K)= 7m w(K), 
where 

^.^i V2T((m + 1)/2) , r - 

1 (m/2) 

is a constant depending only on m. 



1 It would have been geometrically more precise to call this quantity the mean half-width. 
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Note that we may extend the definition of the Gaussian mean width to all bounded sets K C R m 
through the formula 

(5) w G (K) = Esup(G,x). 

The Gaussian mean width of K is an intrinsic parameter and does not depend on the ambient 
dimension: if K lives in a subspace E C R m , the formula ([5]) gives the same value whether G is a 
standard Gaussian vector in R m , or a standard Gaussian vector in E. 

2.2. Quantum states. Throughout the paper, we consider a (finite dimensional) complex Hilbert 
space "H, equipped with a norm which we will also denote by | • |. 

A quantum state on % is a positive trace one operator on "H. We use T> = T>(H) to denote the 
set of all quantum states on "H. The extreme points of this set are pure states, in particular if 
n = C n , then 

X?(C n ) =conv{|^)(^| : tp E C™, \ip\ = 1}. 

Above (and, when convenient, in what follows) we use Dirac's bra-ket notation: is a column 
vector, = is a row vector and is their outer product, the orthogonal projection onto 

O0. The set V{C n ) is contained in the (real) space Ai™ of n x n self-adjoint matrices, endowed 
with the Hilbert-Schmidt inner product (A, B) = tr(AB). Whenever considering a geometric 
invariant (inradius, mean width, . . . ) of a set of matrices, it will be tacitly understood that the 
Hilbert-Schmidt Euclidean structure is used. This applies also to spaces of not-necessarily-self- 
adjoint and/or rectangular matrices; in that case (A, B) = tr(AB^). We will also occasionally use 
the Schatten p-norm || A || p = (tr(AU)P/ 2 ) 1/p . The limit case || • | loo coincides with the operator 
norm || • || op (from the category of normed spaces), while || • H2 = || • \\hs is the Hilbert-Schmidt (or 
Frobenius) norm. 

For every dimension n, we introduce now a family of probability distributions on T>(C n ) which 
plays a central role in this paper. These probability measures are known as induced measures and 
can be described as follows. Fix an integer s and let \if))(i/j\ be a random pure state on the Hilbert 
space C n £g> C s , where ip is a random unit vector uniformly distributed on the sphere in C n <g> C s . 
Then consider the partial trace of I'^XV'I over the resulting state is a random state on C n and 
we denote by /i„ >s its distribution. 

When s ^ n, the probability measure has a density with respect to the Lebesgue measure 
on T)(C n ) which has a simple form 



< 6 > f^zh de "»" 



J II.. 



where Z UjS is a normalization factor. Note that formula allows to define the measure fi ntS (in 
particular) for every real s ^ n, while the partial trace construction makes sense only for integer 
values of s. 

In the important special case when s = n, the density of the measure /i n)n is constant. A random 
state distributed according to yU„ in is uniformly distributed on V(C n ) (i.e., uniformly with respect 
to the Lebesgue measure). 
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It is important to consider the case when the Hilbert space 7i itself carries a tensor product 
structure. For simplicity we will largely focus on the bipartite balanced case where % = C d <8> C d 
(multipartite Hilbert spaces are considered in Sections |5] and El while some remarks on extensions 
to the unbalanced setting % = C dl <g> C d2 , di ^ d 2 are given in Section [7^2]) . 

A quantum state on C d ® C d can be either separable or entangled, and this dichotomy is 
fundamental in quantum theory. By definition [15], a state p is separable if it can be written as 
a convex combination of product states (i.e. states of the form p\ ® p 2 , where pi, p% are states 
on C d ). If we denote by S(C d ® C d ) C V(C d ® C d ) the subset of separable states, an equivalent 
description is the following 

S(C d ®C d ) = conv{|^ 1 ®^2)(^i®^ 2 | : */>i,if>2 e C d , = |V 2 | = !}• 

A state which is not separable is called entangled. We denote n = d 2 the (complex) dimension 
of the space C d <S> C d , which we identify with C n . The affine hyperplane 

M™' 1 = {Ae : tr (A) = 1}, 

contains the set V = V(C n ) of states and its subset S = S(C n ) of separable states; they are both 
of full (real) dimension m = n 2 — 1 . 

We want to consider A^"' 1 as a vector space where the role of the origin is played by the 
maximally mixed state I/n, where I denotes the identity matrix. One way to formalize this point 
of view is to work with the linear hyperplane 

M s n a '° = {Ae M s n a : tr (A) = 0} = AC' 1 - I/n 

and with the translated convex body 

S = S -I/n = {p-I/n : p G S} C M s n a '°. 

Similarly, we denote T>q = T> — I/n (in the sequel, we will use analogous notation also for other 
sets). The geometry of the sets S and T> plays a central role in our argument. Estimates on some 
known geometric parameters associated to these convex bodies are gathered in Table HJ 

Table 1. Radii of T> and S for C d ® C d , where n = d 2 . All these parameters are 
translation invariant, so their values for T>q and So are respectively the same. In 
each row the quantities increase from left to right. 





inradius 


volume radius 


mean width 


outradius 


V(C n ) or V{C d g) C d ) 


= Wn(n-l) 






= y/(n - l)/n 


S{C d ® C d ) 


= l/y/n{n-\) 


~ n _3/4 


~ n ~ 3 / 4 


= V( n - l )l n 



The volume of D was computed exactly in [19] and it was noted in jlQ] that the mean width and 
the volume radius have the same order. The remarkable fact that T> and S have the same inradius 
was proved in [17] . (An alternative argument is based on a dual formulation given in jl2] , a proof 
of which was provided by H.-J. Sommers, see [31].) Sharp bounds on the volume radius of S were 
given in [3] (the ratio vrad(«S)/ vrad(P) is estimated in Theorem 1 in [3]). The estimate for the 
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mean width of S does not appear explicitly in [3] but follows from the argument, since the upper 
bound on the volume radius was obtained via Urysohn's inequality (j2D- Finally, the calculation of 
the outradii is easy: they are attained on pure states. Note that the inradii and the outradii of T> 
and S are attained on balls centered at I/n, which is the only point invariant under isometries of 
each of these bodies. 

It is easily checked that V (C n )° = —nV (C n ); this is a consequence of the fact that the cone 
of positive matrices is self-dual (cf. more general comments in the paragraph following ([TO])). We 
deduce immediately from Table [T]that the mean width of T>o(C n )° is of order yfn. The situation is 
not so simple for S, and estimating the mean width of iSo(C d ®C d )° is the main technical difficulty 
in our argument. 

The main result of the paper is the following theorem. 

Theorem 2.1. There are effectively computable absolute constants C, c > and a function Sq(oI) 
satisfying 

cd 3 ^ s (d) ^ Cd 3 \og 2 d 1 

such that if p is a random state on C d <S> C d distributed according to the measure pd 2 ,s, then, for 
any e > , 

(i) if s ^ (1 — e)so(d), we have 

P(p is separable) ^ 2 exp(— c(e)d 3 ), 

(ii) if s ^ (1 + e)s (d) we have 

P(p is entangled) ^ 2 exp(— c(e)s), 

where c{e) is a positive constant depending only on e. 

Theorem 12.11 asserts that, for fixed d, the character of the induced state changes sharply from 
generic entanglement to generic separability as s, the dimension of the ancilla, increases. If we 
knew that the threshold dimension so(d) was regular enough, it would immediately follow that the 
roles of d and s can be exchanged. The following is a result in that direction which can be deduced 
with relatively little effort. (A statement in a similar spirit — but much less precise — appears in 

Corollary 2.2. Consider a system of N identical particles (audits) in a random pure state. Then 
there is a threshold ko = ko(N) ~ N/5 such that two subsystems of k particles each typically share 
entanglement if k > k , and typically do not share entanglement if k < k . 

We next describe the threshold function So(d) appearing in the main theorem. Let G denote the 
standard Gaussian vector in the space A4™'°, which we will also call a GUE° random matrix. One 
may represent G as follows. Start from annxn GUE random matrix G', which is the standard 
Gaussian vector in the space .M™ (see [U [12] )• Then G can be realized as G' — -^-1 (a conditional 
expectation of G'). Equivalently, one may realize G by conditioning G' to be of trace 0. In this 
notation, sq is defined as follows. 
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Definition. For every integer d, we define so = so{d) by the formula 

(7) Md) := (^-jP*)' ~ -(5 °) 2 , 

where G denotes a GUE° matrix of size d 2 x d? . 

Our proof of Theorem 12.11 consists of two largely independent parts 

• proving that d 3 < s < d 3 log 2 d and 

• showing that sq is indeed a sharp threshold for the separability of random states, 

which will be dealt with separately in the next two sections (except for some fine points regarding 
the probability estimate in part (i), which are clarified in Section 17. 5p . The heuristics behind 
deducing Corollary 12.21 from Theorem 1 2 . 1 1 was explained in the Introduction. A rigorous argument 
requires two additional simple observations (Lemmas 14.31 and 17.31) and is sketched in Section 17.61 

3. Estimation of s 
This section is devoted to the proof of the inequalities 

(8) d 3 <s (d)<d 3 log 2 d, 

which comprise the first assertion of Theorem 12. 11 We introduce the convex body <S sym = — Sq C\Sq, 
a symmetrization of <So- We have 

Pk ym = max(||A|| 5o , || - A|| 5o ) < \\A\\ So + || - A\\ So , 

from which (because the distribution of G is symmetric) it follows that 

(9) E||G|| 5o ^E||G|| 5sym ^2E||G|| 5() , 

and so it is enough to establish bounds for E ||G||,s sym . 

Before embarking on that task we will make some elementary observations. First, the inradius of 
S sym equals the inradius of Sq, which (cf. Tabled]) is (n(n— l)) -1 / 2 > 1/n. On the other hand, any 
matrix A € Sq satisfies A ^ —1/n. This implies that any A £ <S sym satisfies —1/n ^ A ^ 1/n, or 
Halloo ^ l/fi, and therefore the outradius of <S sym is bounded by l/\/n. These estimates translate 
into the following inequalities, which we note for future reference 

(10) ™ 1/2 | • K II • \\s sym < n| • I ^ ^l-l^ll-H^^n- 1 / 2 !-!. 

It turns out that evaluating directly E ||G||5 or E ||G||5 S m is not easy. This may conceivably 
be related to the fact that computing ||A||s is an NP-hard problem [16]. Alternatively, we may 
note that E ||G||s is directly related (via <^j) to the mean width of Sq. Since there is a canonical 
link between duality of cones and duality of bases of cones (see [12], Lemma 1), it follows that any 
question about Sq is equivalent to a question about the cone of block-positive matrices and - via 
the Choi-Jamiolkowski isomorphism - to a question about the notoriously difficult to study cone 
of positivity-preserving maps on Aid, the algebra of d x d complex matrices (see [8], or sections II 
and III in [12] for details). 
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The approach which succeeds is to proceed through a duality argument. First, we estimate 
E||G||so m , which is an easier task. Then, we use a general theorem saying that for any "well- 
balanced" symmetric convex body one can deduce the average of the norm from the average of the 
dual norm, with a multiplicative error logarithmic in the dimension. 

Lemma 3.1. In the notation of the present section, we have 

E\\G\\s° ~n 1/4 . 

Lemma 3.2. In the notation of the present section, we have 

n 2 <E||G|| 5 ■EllGlUo <n 2 logn. 

r%j M l|0 S ym II 1 1 °sym f^J o 

Once both lemmas are proved, we infer that 

n 7 / 4 <E||G|| 5sym <n 7 / 4 logn, 

and the inequalities (JSJ) follow easily from formulae (JZJ) and §§§ (recall that n = d 2 , hence logn = 
21ogd). 

Proof of Lemma l3J[ The assertion of Lemma 13.11 is equivalent to (cf. d3J) and fl4])) 

w(S sym ) ~ n" 3/4 . 

We already know (up to universal multiplicative constants) the values of the volume radius and 
the mean width of So (cf. Tabled]). We have 

vrad(<S ) — w(S ) ~ n _3//4 . 

We also know that the volume is only mildly affected by symmetrization 

vrad(S sym ) ~ vrad(S ). 

This is the content of Proposition I A. 1 1 from Appendix [A] (to check that is the center of mass of 
So, average over local unitaries). We therefore have the following chain of inequalities (the first is 
trivial and the last is Urysohn's inequality (J2J)) 

w(S sym ) < w(S ) ~ vrad(So) ^ vrad(5 sym ) ^ w(S sym ). 

Therefore all these quantities are comparable, and Lemma 13.11 follows. □ 

Proof of Lemma \3.S[ The proof makes use of the ^-position of convex bodies, which is reviewed in 
Appendix [El Note that the first inequality in Lemma 13.21 is elementary, see the last paragraph in 
Appendix |Ej 

The upper inequality will follow from the MM*-estimate, which is valid for any symmetric 
convex body that is in the ^-position. If we knew that the convex body iS sym was in the ^-position, 
the result would be immediate from Proposition IE. 41 (applied with m = n 2 — 1, which implies 
logm < 21ogn). However, there are not enough symmetries present to conclude automatically 
that <S sym is in the -^-position. 
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We proceed as follows. Let E C -M* a, ° be the subspace spanned by the operators of the form 
o~i © cr 2 , where <7i and 02 are self-adjoint operators with trace on C d . Let F be the orthogonal 
complement of E in A^^ a '°. We then have 

F — {<7i (8) I : tr en = 0} © {I © cr 2 : tr a 2 = 0} =: F 1 © F 2 . 

Clearly dimi? = (n — l) 2 and dimF = 2n — 2. 

Let M : A'!* ' — > A'!*' 1 ' be a linear map such that w(iS sym ) is in the ^-position. By combining 
Lemma ID. II with Lemma IE.3I from the Appendices, we may assume that u has the form 

u = P E + (0 E ®v) 

for some (positive definite operator) v : F — >• F, where 0^ is the zero operator on E. The ideal 
property of the ^-norrn implies that 

is sy JP E ) = ts sy JuP E ) ^ £ Ssy Ju), 

and similarly for £s° ym (PE)- By the MM*-estimate, we know that £s Byin { u )£s° yrn { u ^ 1 ) ^ n 2 logn, 
and therefore £s sym (PE)£s° ym (PE) < n 2 \ogn (note that u" 1 = P E + (O^©^ -1 )). To deduce similar 
estimates for I in place of P E we need the following observation. 

Claim 3.3. £ Ssy jP F ) = o(£ Ssy Jl)) and £ s ° y jP F ) = o(^ ym (I)). 

Once the claim is proved, using the triangle inequality to bound £ Ssym (T) ^ £s sym (PE) + £s sym (PF) 
(and similarly for £s° m ), we obtain ^5 sym (1)^5° m (I) < n 2 logn, which is equivalent to the second 
inequality in the assertion of Lemma 13.21 □ 

Proof of Claim [PI We use the estimates on the inradius and the outradius of <S sym (see ([TO]) ) to 
deduce the following inequalities (recall that j m was defined in (j4j) 

£ Ssy jP F ) = E \\P E G\\ Ssym = w G ((S sym n F)°) < n ldimF < n 3 / 2 , 

% m (^) = E \\P F G\\ SSym = w G (P F S sym ) ^ n" 1 / 2 7dimF < 1. 

On the other hand, we have £sg ym (I) — nl ^ 4 (by Lemma 13. 1 p and £s sym (I) > n 7 / 4 (by the already 
shown lower estimate from Lemma 13. 2p . This proves Claim 13.31 and concludes the proof of Lemma 
I3.2[ and of the first assertion of Theorem 12.11 □ 

Remark. In the proof of Claim [X3|. we upper-bounded the mean widths of the convex bodies P F K 
and (KnF)° by their outradii. This is sufficient for the present argument, but these estimates are 



far from optimal. A more refined analysis is performed in Lemma \5.4\ it will be needed to handle 
the case of multipartite systems and unbalanced bipartite systems. 

4. Proof that s is a threshold for separability 

Let p = p n ^ s be a random state on C n with distribution p, n>s . The first step is to approximate 
p — l/n by ^^G, where G = G n is a n x n GUE° random matrix. Here is a precise statement (the 
proof is given in Appendix |C]) . 
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Proposition 4.1. Denote by c n>s (resp. C n)S ) the largest (resp. smallest) constant such that for 
every convex body K C -M* a, ° containing in its interior, if p is a random state on C n distributed 
according to p n>s , and if G is a standard Gaussian vector in Ai^ L, ° , we have 

I 

P ~ 



(11) 
Then 



n^fs 



E||G||jr<E 



lim Cn 



n 



K 



n-v/s 



lim c Ti 

n, — — >oo 



Remark. Let us clarify what double-indexed limits mean (here and later). The statement 

lim C U)S = 1 

n, — — >oo 

is equivalent to the following: for any sequences (nk), (s&) such that both (n^) and (s^/rife) tend to 



infinity, we have lim C n 



1. 



Remark. While the formulation of Proposition \J7I\ focuses on the regime when n and s/n tend 
to infinity, the proof can be adapted to other situations. For example, one can show that, for any 
a > 0, 

< inf Cn,s ^ sup C ntS < +oo. 

This allows to establish a threshold phenomenon even for properties - in place of separability - for 
which so(rf) — d 2 . However, in that case the argument does not yield the sharp threshold property, 



i.e., involving arbitrary e > 0. See Section 7.4 for more comments on related issues. 



Applying Proposition 14. 1 1 for K = Sq and using the definition <^ of so(d), we obtain that (when 
d and s/d 2 tend to infinity) 



(12) 



E 



I 

P 

n 



s 



s (d) 



Since a state p is separable when \\p — I/n\\ So ^ 1 and entangled when \\p — I/n||s > 1, this 
suggests that separability is typical when s > so(d) and entanglement is typical when s < so(d). 
This will be made rigorous through the next proposition; its proof is based on concentration of 
measure and postponed to Appendix [Fl 

Proposition 4.2. Let s ^ n, K C T>(C n ) be a convex body with inradius r, and p a random state 
with distribution p UjS . Let M be the median of \\p — I/n||# , with K Q = K — I/n. Then, for every 

T)>0, 

^ ?7 < exp(— as) + exp(— cn 2 sr 2 rj 2 ) 



P 



n 



M 



K 



We shall now show how to use Propositions 14.11 and 14.21 to conclude the proof of Theorem 12. 11 
For any integers d and s, we define TTd, s as the probability that a random state on C d ® C d with 
distribution p^ 2 s is separable. We first show that 7Td, s is decreasing with respect to d. 
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Lemma 4.3. Let s,d±,d2 be integers such that d\ ^ <i 2 . Then 

Proof. Identify C dl as a subspace of C d ' 2 , and let Q : C d2 —> C dl be the orthogonal projection. 
Then, C dl (g) C dl C C d2 Cg> C d2 is the range of the projection P = Q cg> Q. Let p 2 be a random state 
on C d2 Cg> C d2 with distribution p d 2 Then the state 

(13) pi ■ 



tr Pp 2 P 



is a random state on C dl £g>C dl with distribution p^ 2 ,s (this is obvious if we realize p 2 as GG^ /tr GG\ 
where G is a <i 2 x s random matrix with i.i.d. iVc(0, 1) entries). The inequality follows from the fact 
that, given the relation (|T3|) . the separability of p 2 implies the separability of pi (local operations 
cannot create entanglement). □ 

Remark. Another natural question is whether si ^ s 2 implies 7Td, Sl K ^d,s 2 > i- e - whether the 
probability that a random induced state is separable always increases with the dimension of the 
environment. We do not know the answer to this question (see Section \775\ ). 



Proof of Theorem \2.1\ We first address the part (i): fix e > 0, then for any s and d satisfying the 
condition s ^ (1 — e)so(d), we have to show that 

(14) 7r d , s ^ 2exp(-c(e)d 3 ). 
We start by establishing a slightly different estimate 

(15) ir^ s ^ 2exp(-c(e)s), 

which is stronger than (fl4|) in the crucial range s > d 3 . The case s = o(d 3 ) of (|T4|) can be then 
deduced formally using Lemma 14.31 and other known facts. We relegate the details to Section 17.51 
since the qualitative information provided by (|T4l) and (|T5|) is essentially the same; the only reason 
why we had not stated the Theorem with the bound (jT5l) is that it would have given a misleading 
impression that the genericity of entanglement wanes as s decreases, while in fact the opposite is 
true. 

In view of Lemma I4T31 to prove (TT5"j) it is enough to consider the case when, for a given s, d = d s 
is the minimal integer satisfying s ^ (1 — e)so(d). In particular, we have then 

lim s/d 2 = +oo 

s— >oo 

and we are in the asymptotic regime described in Proposition 14. 11 Let M s (resp. E s ) be the median 
(resp. the expectation) of ||p — I/d 2 !!^, when p is a random state on C d ® C d with distribution 
p d 2 s . Applying Proposition 14.11 we have, when s — > oo, 



E. > M - 1 
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We now apply Proposition 14.21 to the convex body K = So (its inradius is of order 1/n, cf. Table 
rjj. This gives, for any ij > 0, 

(16) P (| \\p - l/d 2 \\ So -M s \>r})< exp(-cs) + exp(-cV). 

Moreover, we have \M S — E s \ < d/yfs (cf. the remark following Lemma |F~3|) . Since d/y/s tends 
to 0, M s is also equivalent to 1/a/1 — s. We now choose rj such that 1/y/l — e — rj > 1, and we 
obtain that for s large enough 

7T d , s = P (||p - I/rf 2 )!^ < 1) < L7exp(-cs) + Cexp(-csr] 2 ) 

Hence there are constants C, c'(e) such that the inequality n^s ^ C exp(— c'(e)s) holds for every s 
(small values of s are taken into account by adjusting the constants if necessary). A priori we may 
have C > 2, but in that case the bound (IT5|) follows with c(e) = c / (e)/log 2 C. This shows part 
(i) of Theorem 12.11 except for the fine points related to the difference between the tail estimates 
(1141) and (1151) . which will be clarified in Section 17.51 The part (ii) is proved in the same way 
as Q15p . which combined with the arguments in the preceding section concludes the proof of the 
Theorem. □ 

5. Extension to the multipartite case 

In this section we consider the case of a multipartite system, on the Hilbert space H = (C d )® fc , 
and estimate the threshold for separability of random states in the asymptotic regime when k ^ 2 
is fixed and d tends to +oo. We denote n = d k = dim'H and m = n 2 — 1 = dimD('H). In this 
section, constants are allowed to depend on k; this is emphasized by writing Ofc(-) instead 

of <, ^, o(-). 

The set of separable states on % is the subset of T>(H) defined as 

S(H) = conv{|V>i ®---®^k){^x ® V»fc| : ipi,...,ipk e C d , |^i| = ••• = |^| = 1}. 

As in the bipartite case, our argument requires estimates on geometric parameters associated 
to 5(H), given in the next table. The statement about the inradius was proved in [18], and the 
statements about the mean width and the volume radius were obtained in [3]. 

Table 2. Radii of S for (C d )® fe , where n = d k . 





inradius 


volume radius 


mean width 


outradius 


S((C d )® k ) 


~ k rT 1 






= y/(n - l)/n 



The following is a multipartite version of Theorem 12.11 

Theorem 5.1. For every integer k ^ 2, there are effectively computable positive constants c k and 
Ck, depending only on k, and a function so(k,d) satisfying 

(17) c k d 2k ^ <: s (k, d) ^ C k d 2k - X log 2 d, 

such that if p is a random state on (C d )® fe distributed according to the measure Pd k ,s! then for 
every e > 0, 
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(i) Ifs ^ (1 — e)so(k, d), then 

P(p is separable) ^ 2 exp(— c(e)s). 

(ii) J/s ^ (1 + s)s (A;, d), i/ien 

P(p is entangled) ^ 2 exp(— c(e)s). 
i/ere c(e) a positive constant depending on e and on k. 

The proof of Theorem 15.11 is completely parallel to the bipartite case, except for one small 
point where a slightly finer analysis is required. We set Sq = So((C d )® k ) = iS((C d ) 0fe ) — I/n, 
Ssym = —So n So and define the threshold so(k, d) as 

E 1 1 C 1 1 ,S \ 2 „ ,/ O0 \2 



So (k,d):= ^ n dk n *° ) ~w(S° 

We first need to show that the estimates (ITT]) hold. We only sketch the proof. The multipartite 
version of Lemma 13.11 reads 

E||G||so ~fc n» 

II H°sym h 

and the proof, using the formula from Table [2] for the mean width of S, carries over verbatim. The 
multipartite version of Lemma 13.21 is 

Lemma 5.2. In the notation of the present section, we have 

n 2 <E\\G\\ Ssym -E\\G\\ s « yrn < k n 2 logn. 

Proof. We proceed as follows. Let E C Ai^ be the subspace spanned by the products of trace 
zero operators, and F be the orthogonal complement of E. We have dimE = (d 2 — l) k and 
dimF = d 2k — 1 — (d 2 — l) k ~ fc n 2_2 / fc . Let u : M s ^° —> M^ be a linear map such that u(S sjm ) 
is in the ^-position. By the results in Appendices [E] and O we may assume that u has the form 

u = P E + (0 E ®v) 

for some positive map v : F — >■ F . As in the bipartite case, the MM*-estimate and the ideal 
property of the £-norm imply that £s sym {PE)£s£ {Pe) i$ n 2 logn, and the lemma will follow from 
the following 

Claim 5.3. £ Ssyin (P F ) = o k (£ Ssym (I)) = o k (n^) and £s Sy jP F ) = o k (£s Sy Jl)) = o*(n£). 

The inradius of S syra equals the inradius of S and is of order n -1 , hence || ■ ||s n \ • I an d 

£s aym (P F ) = E \\P F G\\ Saym < k nE\P F G\ = n ldimF ~ fc n 2 ^, 

which implies the first part of the claim. The second part requires a little finer analysis. For 
1 ^ 3 ' ^ k, denote by Fj the following subspace of A4^ a, ° 

Fj = (M sa (C d ) <g> • • • <g> M sa (C d ) <g) RI <g> M sa (C d ) ® • • • <g> M sa (C d )) n AC' , 
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where the factor RI appears at position j. Since F C ® ■ Fj, it suffices to prove that ^s° ym (.PF 3 -) = 

o k (n^k) — it follows from the ideal property that £s° ym {Pv) ^ £s° ym {Pv) whenever V C V. By 
symmetry we may assume j = 1. We have 

(18) Zs Sy jP Fl ) = E \\P Fl G\\s Syin = w G (P Fl S sym ) < ^ G (P Fl 5 ). 

It turns out that the convex body P Fi Sq has a simple description. 
Lemma 5.4. For every k ^ 2, we have 

P Fl S ((c d f k ) = s ((c d f k ) n F 1 = I/d ® <S ((CT^) , 
w't/i £/ie notation x ® K = {x ®y : y G if}. 

Proof. The inclusions D are immediate. Conversely, let p G S ((C a! ) 0fc ). Starting from the separa- 
ble decomposition 

we obtain 

P Fl p = £ Ail/d ® pf } ® ■ • ■ ® pf } - = Vd ® {o- - I/d k ~ l ) , 
with aeS ((C^)®^" 1 )) defined as 

a = ^A,pf ) ®---®pf ) . 

It follows that P Fl S ({C d )® k ) = Pf,S ({C d )® k ) C I/d (g) 5 ((C d )^ k ^) . □ 

Remark. In the case k = 2, the convex body Sq ((C*)®^ -1 ') should be interpreted as P (C d ). 

From Lemma 15.41 it is very easy to compute wg{P Fi <So)- Since \\l/d\\ 2 = 1/Vd, the convex 
bodies P Fl So ((C d )® k ) and -^<$o ((C^)®^ -1 )) are congruent, hence have the same Gaussian mean 

width. Since w G (S ((C d )^ k ^)) Vd (see Table EJ, we obtain from (pj| that 

£ S o y jP Fl ) < w G {P Fl S ((CT k )) = -Lw G {S ((CT^)) ~ fc 1 

and the claim is proved. The rest of Theorem 15.11 follows by mimicking the proof in the bipartite 
case given in Section HI □ 

6. Generic entanglement: an alternative approach 

In this section we sketch an alternative proof that entanglement on C d ® C d is generic when s is 
sufficiently smaller than d 3 (Theorem 12. l( i)). The proof also yields a similar statement addressing 
two different regimes of the the multipartite case, including the one dealt with in Theorem 15. II i). 
In its present form, the argument does not show that separability is generic for larger s. However, 
we present it here since it leads to sharp estimates in the "small ball" regime (i.e., when the 
probability of separability is very small), and since it is much more straightforward. 

The proof is based on analyzing directly the density fl6]) of /i njS and specifically on the following 
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Lemma 6.1. Let s ^ n with logs <C n. Then there exists a universal constant C > 0, such that 
for any measurable subset K. C T> = T>(C n ), 



V n 



where m = n 2 — 1 = dimX>. 



Assuming the lemma and remembering that /i nj „ is just the Lebesgue measure (the Hilbert- 
Schmidt volume), it is easy to deduce (an estimate stronger than) the first assertion of Theorem 
12.11 from known results. Indeed, if n = d 2 and K, — S = S(C d ® C d ), we can read from Table fj] 



that 



Consequently, 



vol 5^™ vrad(iS) n 3 / 4 __i/4 



d 3 ) ' 

and so the probability of separability is very small if s < C^r?! 2 = C^d 3 . 

It is also straightforward to obtain in the same way estimates on the threshold value of s for 
multipartite entanglement (i.e., for all k ^ 2). Sharp bounds for ^2l±L j n the multipartite case were 
derived in [101 E]: 

Proposition 6.2. Let H = [Q d ^® k and let S(H) be the corresponding set of k -partite separable 
states. Then 



n 



volV(H) 



The two bounds in the Proposition reflect emphasis on two regimes, fixed k and large d (small 
number of large subsystems) and fixed d and large k (large number of small subsystems). When 
combined with Lemma 16.11 they lead immediately to the following 

Theorem 6.3. In the notation of Proposition let s ^ n = d k and let p be a random state on 
% distributed according to the measure fi nyS . Then P(p G S("H)) is (exponentially in m = n 2 — 1) 
small if s ^ c{k)n 2 ~ l l k or s ^ c(k)c'(d)n 2+ ^ d , where c(k) > (/clog k)~ l and c'(d) > d~ l . 

As the above bounds on the threshold values of s may appear not- very-transparent, we will make 
them explicit in some special cases. First, in the bipartite case k = 2 (two large subsystems), we 
again recover the bound of order r?l 2 = d 3 . In the tripartite case k = 3, entanglement is generic 
if s ^ cn 5 / 3 . If d = 2 and k is large (k qubits), the bound on the threshold value of s is (modulo 
factors of smaller order) n 2+/?2 , where fa ~ 0.18872 (we also have the equality n 2+l32 = 2 fc 3 3fc//4 , but 
the latter expression is again not-so-transparent). The numerical constants implicit in the >, < 
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notation in the Proposition and in the Theorem are effectively computable and can be recovered 
from the discussion following Theorems 1 and 2 in [3] and from the proof of Lemma 16.11 

Proof of Lemma \6. 1\ ( sketch). The argument follows the lines of [48] . where related questions 
were considered. Since, by the arithmetic-geometric mean inequality, det p ^ n~ n for p G V(C n ), 
formula ([6]) implies 

Hn,s{K) = ^- [ (det p) s ~ n dp ^-L [ n-< s ~ n Up = n-< s - n ^p n , n (1C). 

Hence the Lemma reduces to showing that ^ n - n ( s - n ) j m < ^J~E OT) equivalently (we may 
replace m = n 2 — 1 in the exponent by n 2 as long as logs <C n), 



(19) 



Z n ,n\ ^ - s x l 2 n s l n 



< 



Explicit formulae for Z UjS are known (see [29], |50l EH1 18]) 

' n ™ 

We point out that these quantities are sometimes referred to in the literature as "the a-volume" 
(with a = s — n + 1). Also, the normalization factors are often calculated for densities on the 
Weyl chamber {(Ai, ••• , X n ) G R n : Y17=i — 1; Ai ^ ■ • • ^ A n ^ 0} in the simplex of 
eigenvalues rather than for densities on the set of states; the two quantities differ by the factor 
^2^n{n-i)l2 J FJ™ =1 r(j), equal to the measure of the corresponding flag manifold. 

What remains is a tedious but routine calculation based on the Stirling formula, used here in 
the form lnT(x) = xlnx — x + O(lnx). We have 



s -lnv^ ~ 4 ( lnr(£;)-lnri 



n 2 




sn) 



(kink — k) — sn ln(sn) + sn 



(x In x — x) dx — — ln(sn) H — 

'k=s-n n n 

-lns-^-(^Hs-n)-^ s s 

ln(sn) + -. 

n z n n 



We now use the bound ln(s — n) = Ins + ln(l — -) ^ Ins — - to obtain, after simplifications, 

In Z nsS r— 1 s 1 
j> In v In in n Ins 

n 2 4 n 2 
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and hence, after exponentiating, 

(Z n , a )£ >n- s / n s- l l 2 . 
Similarly, but in a much simpler way, we are led to 

(This calculation was already performed in |40], where the equivalent formula vradI>(C n ) ~ 
g-i/4^-1/2 wag derived.) Combining the last two estimates we obtain f ll9p . □ 

7. Miscellaneous remarks and loose ends 

7.1. Threshold from zero to nonzero probability of separability. In this paper we esti- 
mated the threshold for separability in terms of the ancilla dimension, and show that the proba- 
bility of entanglement changes dramatically from nearly one to nearly zero around this threshold. 
A seemingly related question (but actually very different) is to ask for which ancilla dimensions 
the probability of separability is exactly zero. Here is a summary of our knowledge about this 
problem. (We do not claim originality.) 

Proposition 7.1. Let p be a random state on C d <g> C d distributed according to the probability 
measure Hd 2 ,s- 

(i) If s ^ d 2 , then < P(p is separable) < 1. 

(ii) If d = 2 and s = 3, then < P(p is separable) < 1. 

(iii) If s ^ (d — I) 2 , then P(p is separable) = 0. 

Proof, (i) follows from both S and T>\S having nonempty interior and from the density of pi n ^ s 
(E]) being strictly positive in the interior of T> for s ^ n. (ii) is slightly more delicate: here s = 3 
and n = d 2 = 4, and /i 4)3 is concentrated on the boundary of V. However, since the eigenvalues 
of MM t (for a 4 x 3 matrix M) are the same as those of M^M plus an additional 0, and since 
the distributions of MM^ and M^M are unitarily invariant, it follows that MM^ has a density 
with respect to the surface measure which is (modulo a normalization factor) det (M^M) , and in 
particular is nonzero on a set of full (surface) measure. On the other hand, it follows from [4T] that 
the probability of separability on the boundary of T> (with respect to the surface measure) equals 
!™n§ ^ (0, h), which combined with the preceding remark yields the conclusion. Finally, (iii) is a 
simple combination of Corollary 3.5 in jH], which asserts that a random subspace of dimension s 
in C d £g) C d almost surely contains no product vector when s ^ (d — l) 2 , and of [23], which points 
out that a separable state must have a product vector in its range. □ 

7.2. The unbalanced case: 7-L = C dl (g) C d2 , d\ ^ d 2 . A result analogous to Theorem 12.11 holds, 
with the threshold ^0(^1,^2) verifying 

cn min{di, d 2 } ^ so(di,d 2 ) ^ Cn(\ogn) 2 min{di, d 2 }, 

where n = d\d 2 . Showing this requires retracing the arguments from the balanced case. We may 
assume d 1 ^ d 2 . First, one checks that the argument from [3] yields w(S) = w(S ) — d 2 ' /n = 
(ndi) -1 / 2 (note that if we are only interested in the bipartite case, the < part follows rather easily 
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from Lemma 2 in [3]). The needed estimates for w(Sq) follow then word by word, except that the 
upper estimate requires a slightly more careful analysis which we detail now. If, as in the bipartite 
case, we denote iS sym = — <So H So, we have 

w G (S sym ) ~ \fd 2 . 

Let E, Ei and E 2 be the subspaces appearing in the proof of Lemma 13.21 The conclusion will 
follow if we prove that (for i — 1,2) 

(21) £ Ssym (P Fi ) < l^wOQ = lw G (S: jm ) ; £ SSy jP Fi ) < \is Sy JI) = lw G (S sym ). 

Indeed, since £ Seym (I) ^ £ Ssym (P F ) + £s Bym (P Fl ) + £s Bym (P F2 ), it follows from (EU) that £ Ssyin (I) ^ 
3£s sym (PE), and similarly for £s° m , and therefore 

w G (S sym )w G (S° sym ) = £ Ssy JI)£ s , y JI) ^ 9£ Say jP E )£ S o y jP E ) < n 2 \ogn 

by the MM*-estimate. 

Using Proposition IE. II and the unbalanced version of Lemma 15. 4| we obtain 

£s sym (P Fl ) = w G ((S sym n Fx) ) = w G ( 

where T> sym = —T> H T> . Similarly, 

£sg ym (P Fl ) = w G (P Fl S sym ) ^ w G (d^ 1/2 V(C d2 )) ~ v^M. 

Note that the equivalences w G (V° ym ) ~ w g (Pq) and WG(^'sym) — w G {V), and the values of these 
parameters, were determined in (JH]) and in Lemma 13.11 respectively. Analogous estimates hold for 
E 2 with the role of di and d 2 exchanged. We conclude that the conditions (12"T|) are satisfied unless 
d\ is smaller than some absolute constant. 

On the other hand, if 2 ^ min{d 1 ,d 2 } ^ C, then V C (C 2 - 1)«S (cf. [T7], Corollary 5; by 
looking at the Schmidt decomposition of a pure state it is easy to see that any bound valid with 
d\ — d 2 — C - the case considered in [17] - holds also when minjdi,^} ^ C). Accordingly, the 
widths of S, Sq are the same as those of T>, T>q "up to universal multiplicative constants" and the 
calculation becomes trivial (in this case, there is no log factor in the upper bound for So(di, d 2 )). 

7.3. Calculating the precise order of the mean width of S°, or is the log necessary. It 

is conceivable that the presence of logarithmic factors in Lemma 13.21 and hence in Theorems 12.11 
and I5.1| is due to the fact that we appeal to a result about general convex bodies (Proposition 
IE.4|) rather than calculating w(Sq) directly (or to the lack of precision in the upper bound from 
Proposition IE.4I) . Due to fundamental nature of the separability/entanglement dichotomy, the 
question about the precise order of w(Sq) is interesting by itself. It may be worthwhile to note 
that for the convex body L = V (C n ), the product w(L)w(L°) is majorized by a universal constant, 
even though the argument via ^-position necessarily leads to an upper bound which is > A/log n. 



d l 



1/2 



£'sym(C a!2 y 



' ' d\ d J , 
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7.4. Thresholds for other "standard" properties of quantum states. Our method can be 
generalized to estimate thresholds for other properties of random induced states (beyond sepa- 
rability), provided the set of states with this property is a convex subset K C T> and has some 
minimal invariance properties (such as being fixed under conjugation with local unitaries). In any 
such application one needs to estimate the mean width of K° , or at least the volume of K in order 
to adapt the argument from Section [6j 

One natural example of K is the set WT(C d ®C d ) of states with positive partial transpose (the 
volume radius and the mean width of which were estimated in [3] and shown to be much larger 
than those of S). Since WT = T> n T(V), where T is the partial transpose, it follows easily (cf. 
flHD) that w(VVT° ) < 2w(V° ) ~ d, whence (cf. ©) s ~ d 2 . However, this is less precise than the 
result from [2] where it was shown - using completely different techniques - that a sharp threshold 
for PPT occurs at s ~ Ad 2 . (While a more careful argument using concentration of measure shows 
that w(VVT° ) ~ 2d, in our approach we will always lose an additional multiplicative constant 
when using Proposition 14.11 Similarly, while it is likely that our argument based on majorization 
may be modified to show existence of a sharp threshold also in the regime Sq ~ d 2 , it is not at all 
clear how to retrieve this way the exact multiplicative factor 4.) 

A very interesting point is that there is a whole range of parameters, when the ancilla dimension 
s satisfies (for an arbitrary e > and an appropriate c > 0) 

{A + e)d 2 ^ s ^ cd 3 , 

where random induced states on C d £g> C d are - for large d - generically bound entangled, i.e. 
entangled and PPT (hence non-distillable). 

7.5. Improving the probability estimates, and sharpness of the threshold. Denote (as 
in Section S]) by ir^ s the probability that a random state on C d ® C d with distribution p d 2 s is 
separable. In Section H] we showed that, if s ^ (1 — e)s (d), then (TT5T) holds, i.e., 

7r djS ^ 2exp(-c(e)s). 

The above probability estimate is not optimal, both in its dependence on s and in the dependence 
on e that can be retrieved from the argument. The following conjecture sounds reasonable (the 
larger is the environment, the rarer is entanglement) and would formally imply the bound iTd,s ^ 
2 exp(— c(e)so(d)), which better agrees with physical heuristics and is formally stronger than both 
(|T5l) and the assertion of Theorem 12.11 (i). 

Conjecture 7.2. For any d ^ 2, the function s i— > 7td, s is non-increasing. 

While we do not prove this conjecture, the next simple lemma is a partial result in its direction. 
Lemma 7.3. For every d, s, we have the inequality 7r 2 d,s ^ ^d,is- 

Proof. Identify C 2d with C 2 <g> C d , and let r : V(C 2d <g> C 2d ) V(C d <g> C d ) be the partial trace 
over C 2 ® C 2 . Let p be a random state on C 2d £g> C 2d with distribution /i 4( ^ s . Then r(p) is a 
random state on C d ® C d with distribution /id2 4s , and 

p separable r(p) separable. 
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This shows the inequality ii2d,s ^ ftd,4s- D 

There are other ways to improve the probability estimate in various ranges. First, note that 
^d,s = if s ^ (d — l) 2 (see Section r7TT]) . Second, Theorem 16.31 implies that, for some absolute 
constants c, C\ > 0, itd, s ^ exp(— cd 4 ) whenever d 2 ^ s ^ Cid 3 . This establishes in particular the 
bound 7Td,s ^ 2 exp(— c(e)d 3 ) asserted in Theorem 12.11 (i), except for the narrow range (d — l) 2 < 
s < d 2 . This exceptional range can be handled as follows. Set d± = [(d+l)/2\; then ir^s ^ ftdi,s by 
Lemma [4.31 On the other hand, s > (d — l) 2 ^ d\ and (if d is sufficiently large) s < d 2 ^ cidf, so 
Theorem 16.31 applies and shows that n dltS < exp(— cdf) ^ exp(— cc? 4 /16). Combining the estimates 
we conclude that the bound ir d ^ < exp(— cd 4 ) extends to the entire range s ^ Cid 3 (perhaps with a 
different constant c). When combined with the argument from Section HI this completes the proof 
of Theorem 12.11 (i) as originally stated. 

Estimates for probabilities in Theorem 12.11 and similar statements translate directly into asser- 
tions about sharpness of the entanglement-separability threshold at s = s (d): the increase in the 
ancilla dimension s needed for the induced state to "flip" from generic entanglement to generic 
separability. As stated, Theorem 12.11 asserts that the increase is o(sq). Retracing the argument 
would allow to come up with an explicit (and clearly suboptimal) bound, apparently of the order 
of Sq for some 9 e (0, 1) (with 9 rather close to 1). On the other hand, finding precise order is 
likely a difficult "small ball" problem (cf. [27]). So, instead of pursuing such calculations, we 
sketch a heuristic argument which suggests the limits of our approach, which may be not far from 
the actual behavior. 

Our analysis shows that the sharpness of the threshold Sq depends on a combination of two effects: 
(i) the decay of E ||/>d 2 ,s — -||s as a functions of s and (ii) the concentration of \\Pd?, s ~ \ \\s around 
its mean (or median). We do know (from ( fl2l ) that the former is approximately 0(s) := y^f, 
and from Proposition 14.21 (see ffl6|) ) that \\pd?, s ~ ~||s ls concentrated on an interval whose length 
is of order choosing as r\ := where a is sufficiently large, makes the right hand side of 
( [16]) small. A simple calculation shows now that the increase in s needed to reduce the value of 
(j)(s) from 1 + 7] to 1 — r] is of order s 1 ^ 2 . The lack of rigor in this calculation stems from the 
approximation given by <p(s) not being known to be valid up to the precision of order t]. 

7.6. Regularity of the threshold function, and proof of Corollary 12.21 Theorem 12.11 is 
a statement about sharp transition from generic entanglement to generic separability as the di- 
mension of the ancilla s increases while the dimension n = d 2 of the system itself remains fixed. 
Clearly, this implies that a similar phenomenon takes place also with fixed s and variable d. How- 
ever, without any additional information about regularity of the threshold function so(d), one can 
not formally infer that — for example — a sharp transition occurs also in this new setting. 

While we do not have a complete picture of the regularity of so(d), or of the associated probabil- 
ities lids, we do have some information (Lemmas 14.31 and 17. 3 j) . which is enough to deduce Corollary 

m 

As noted already in the Introduction, the setting of Corollary 12.21 (i.e.. iV particles with D levels 
each and two subsystems of k particles each) is modeled by a random induced state on C d ®C d with 
d = D k and s = D N ~ 2k . Thus we need to show that the sequence pk '■= ir D k D N-2k has the following 
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property (i) for some small 5 > (which quantifies the "near 0" and "near 1" probabilities) and 
for some ko ~ N/5 (the threshold value of k), pk > 1 — 5 if k < k and pu < 5 if k > ko- Except 
for the value of ko, this is equivalent to the following (ii) if pk ^ 1 — 5, then pk+i < 5. We note 
that a slight generalization of Lemma T7.3[ with 2 replaced by an arbitrary D and 4 by D 2 , implies 
that the sequence (pk) is non- increasing; this is not necessary for our argument, but nevertheless 
reassuring. Also, if k is substantially smaller than N/5, then iV — 2k is substantially larger than 3k 
and so s = D N ~ 2k is substantially larger than D 3h = d 3 and, consequently — by Theorem 12.11 - 
Pk is close to 1. Similarly, if k is substantially larger than N/5, pk is close to 0. Accordingly, there 
is no doubt that the transition from p k ~ 1 to pk ~ does indeed occur as k increases, and that 
it occurs when k ~ N/5. The only point that needs to be made is the sharpness of the transition. 

To that end, note that — by Theorem 12.11 — a property similar to (i) and (ii) holds for the 
sequence := tt^d* for any fixed d: if D l ^ (1 — e)s (d), then tt^d* < 8 and if D % ^ (1 + e)so(d), 
then it^d* > 1 — <5 (as long as 5 > 2 exp(— c(e)s) for the appropriate values of s), and (1 —e)so(d) < 
D % < (1 + e)s (d) may happen at most for one value of i ~ log D (s (d)) ~ 31og£,<i (there will be 
no such i at all, unless log D (s (^)) is close to an integer). 

To show that the condition (ii) above is satisfied for the sequence (pk), we argue as follows. If 
Pk = 7r D fc £,jv-2fc ^1 — 5, then — by Lemma [T751 — also iT D k+i D N-2k ^1 — 6, and so the observation 
from the preceding paragraph applied with qi = ir D k+i t £)i implies that 7T£)fc+i i £)jv-2fe-i < 5 and, 
applied one more time (this works if 6 ^ 1 — 5, or if 5 ^ 1/2), that pk+i = 7i D k+i tD N-2k-2 < 5, as 
needed. 

Appendix A. Symmetpjzation mildly affects volume 

Some aspects of the theory of high-dimensional convex geometry require the hypothesis of sym- 
metry. For a general convex body, there are several options to associate to it a convex body which 
is symmetric. Usually volume and mean width are essentially unchanged by the symmetrization 
operation. Several famous results in this direction are due to Rogers-Shephard (cf. [32]); they 
additionally assert that the worst case occurs when the body is a simplex. 

We use the following inequality which is a variation on the Rogers-Shephard inequality : 

Proposition A.l. If K C R m is any convex body with center of mass at the origin, then 

vo\(-K n K) > 2~ m vo\{K). 

A proof can be found in [30] (Corollary 3). The factor 2~ m is not likely to be sharp; it seems to 
be unknown whether the simplex is the extremal case. 

Appendix B. Majorization and oo-Wasserstein distance 

We gather here some facts concerning the usual modes of convergence from probability, oo- 
Wasserstein distance, and the concept of majorization. They will be used in the proof of Propo- 
sition 14.11 in the next appendix, but they are fairly general and independent of that particular 
application; we believe that stating them separately may be of reference value. 
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Definition. Let /ii, \i 2 be 'probability measures on R. The oo -Wasserstein distance is defined as 

daoifii, H 2 ) := inf \\X 1 - X 2 ||l<», 

with infimum over all couples (Xi,X 2 ) of random variables with (marginal) laws [i\ and fi 2 , de- 
fined on a common probability space. Similarly, if Yi,Y 2 are random variables, we will mean by 
^00(^1,^2) the oo-Wasserstein distance between the laws ofY\ and Y 2 . 

Note the following inequality: whenever / : R — > R is a L-Lipschitz function and X, Y are 
bounded random variables, we have 

(22) \Ef(X)-Ef(Y)\^Ld 00 (X,Y). 

The oo-Wasserstein distance can be computed from cumulative distribution functions: if Fx{t) = 
P(X ^ t), then 

doo{X, Y) = inf{e > : F x (t - e) ^ F Y (t) ^F x (t + e) for all t G R}. 

The following lemma is elementary and can be proved by using the fact that the Levy distance 
metrizes the weak convergence for probability measures (see e.g. [15], Section 4.3). 

Lemma B.l. Let Z be a random variable distributed according to a measure fiz, with support 
equal to some bounded interval [a,b]. If (Y n ) is a sequence of random variables, the following are 
equivalent: 

(1) dniYn, Z) -+0, 

(2) Y n — > Z weakly and supF n — > b, mfY n — > a. □ 

Note that the hypothesis on the support is crucial: the equivalence fails if the support is not 
connected. 

Next, we will relate the oo-Wasserstein distance to the concept of majorization, usually defined 
for vectors in TV 1 . Given x G R n we will denote by the non-increasing rearrangement of x. If 
x, y G R n '° (the hyperplane of sum vectors in R n ), we write x -< y if , for every k G {1, . . . , n}, 
we have 

k k 

8=1 8=1 

(note that for k = n we always have equality). The following is well-known (see [9], Section II). 

Proposition B.2. For x,y G R n '° the following are equivalent. 

(1) x -< y, 

(2) whenever is a permutation-invariant convex function on H n, °, then <fr(x) ^ 4>{y), 

(3) For every t G R ; we have Y17=i \ Xi ~ A ^ SI=i \Vi ~ ^1; 

(4) x can be written as a convex combination of coordinatewise permutations of y. 



By inf and sup we really mean here essential inf and sup. 
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We will need a quantitative formulation of the concept of majorization. If x, x' G R n, ° with 
x' ^ 0, we denote by 5(x, x') the smallest positive constant c such that x -< cx'. In other words, 
S(-,x') is the gauge associated to the convex body obtained as the convex hull of permutations of 
x'. We have the inequality 5(x, x") ^ 5(x, x')5(x', x"). The quantity 5(x, x') should not be thought 
of as a distance, but rather as the norm of a certain operator. 

We can rephrase the concept of majorization in the language of "n-point empirical measures," 
i.e., the probability measures of the form v x = n -1 X^fc=i $x k - The restriction requiring that x G 
R n, ° translates into the underlying random variable having zero mean. Note that (by Proposition 

El 



5(x,y) = 5(y,x) = 1 y is a coordinatewise permutation of X \ )' V X Uy. 

The following key observation connects majorization and convergence in oo-Wasserstein distance. 

Lemma B.3. Let Z be a non-constant bounded random variable with mean and with distribution 
Hz- Then, for every e > 0, there exists r) > (depending on Z and e) such that for every n and 
for all vectors x, y G R n '° satisfying doo^, jiz) ^ t] and d^^Uy, ftz) ^ f], we have 5(x, y) ^ 1 + e. 

Proof. Denote a = inf Z and b = sup Z (the hypotheses on Z imply a < < b). Let e > 0, 
and x,y G R n '° such that d^^, Hz) ^ V and d^Uy.^z) ^ V- We must choose r\ to ensure that 
S(x, y) ^ 1 + e or, equivalently, x -< (1 + £)y. By Proposition IB. 21 this is equivalent to 

n n 

^2\xi-t\ ^ 5Z|(1 + £)yi-t\ foralltGR 

i=l i=l 

(where x = (xj) . = and y = (jjij ) or, in the language of n-point empirical measures, 
(23) f (t) := J \u - t\ dv x {u) «C J \(l + e )u-t\ du y {u) =: g (t). 

We first note that these conditions are automatically satisfied if t ^ maxjXj. Indeed, we have then 
\u — t\ = t — u for all u in the support of v x and so the first integral above equals J tdv x — f udv x = 
t — = t. On the other hand, we always have | (1 + £)u — t\ ^ t — (1 + £)u and so the second integral 
is at least J tdv y — J (1 + £)udv y — t — (1 + e)0 = t. Similar argument applies when t ^ min^ x,. 

If we choose rj ^ |min(— a,b), then we have (1 + e/2)a ^ min^Xj ^ maXjXj ^ (1 + £/2)b and 
therefore it suffices to show the inequality /o ^ 9o on the interval ((1 + e/2)a, (1 + e/2)6). To that 
end, we compare the functions /o an d go with the functions 

f(t) = B\Z -t\ = J \u-t\dfi z (u), 

g(t) =E\(1 + £)Z -t\ = J \{l + £)u-t\dfi z (u). 
Lemma IB .31 will now follow from the following lemma. 

Lemma B.4. In the above notation, f(t) < g(t) for t G ((1 + £)a, (1 + £)b) . 
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Indeed, since / and g are continuous, there is a number 9 > such that f(t) < g{t) — 9 for 
every t G [(1 + e/2)a, (1 + e/2)b\. On the other hand, by {22}, we have |/ (t) - f(t)\ ^ 77 and 
\go{t) — g(t)\ ^ (1 + e)r] for any t G R. It remains to choose 77 such that (2 + 5)77 ^ 9 to guarantee 
that /o < 9o on [(1 + e/2)a, (1 + e/2)&] . □ 



Proof of Lemma B.4 Assume first t ^ 0. We have 



g(t)-f(t) = J (|(1 + s)u - t\ - \u - t\) d/i z (u) = J (|(1 + e)u - t\ - \u - t\ + eu) d^ z {u). 

It is now elementary to check that the last integrand is if u ^ t/(l + e) and strictly positive 
if u > t/(l + e). Accordingly, the integral is always nonnegative, and it is strictly positive if 
the interval (t/(l + e),oo) intersects the support of fiz, that is exactly when t/{l + e) < b, or 
t < (1 + e)b. The case t ^ is handled similarly (or by a change of variable u = —v). □ 



Appendix C. GUE approximation to induced states: Proposition 14.11 

The strategy of the proof is as follows. First, we gather known facts from Random Matrix 
Theory which assert that, for the regime in question (i.e., n, - — > 00), the appropriately normalized 
ensembles (G n ) and (p n s — -) converge (in probability) to the same limit in the sense of non- 
commutative probability. Then we will show that - in the same regime - the expectations of the 
gauges || • \\k must be asymptotically the same for both ensembles, which is essentially the assertion 
of Proposition 14.1} this part of the argument will be based on material from Appendix [B] 

We first set some notation. Recall that M. 13 ® is the space of self-adjoint operators on C n and 
j\yfsafi the subspace of self-adjoint trace operators. We denote by sp(A) G R n the spectrum of 
an operator A G -M™ (ordered in the increasing order for definiteness, but this is irrelevant). Note 
that A G M s n a >° sp(A) G R n>0 . Conversely, if x G R n , we denote by Diag(x) G M s £ the 

diagonal matrix built from x. 

Recall that the standard semicircular distribution (or Wigner distribution) is the probability 
distribution /i sc with density 

^-V4 - X 2 1\ X \^ 2 - 

Note that a random variable Z with semicircular distribution satisfies the hypotheses of Lemmas 
m and EH 

We are now in a position to state the needed facts from Random Matrix Theory. Recall that the 
n-point empirical measure associated to a vector x = (xk) G R n is the measure v x = n~ x Y^k=i <^ fe - 

Proposition C.l. For every n, let G n be an n x n GUE random matrix, and let ^ sp („-i/2 Gn ) be 
the rescaled empirical spectral distribution. When n tends to infinity, the sequence (^ sp ( n -i/2G n )) 
converges to fi sc , in probability, with respect to the 00- Wasserstein distance. 

Convergence in probability means that for any e > 0, lim^oo P(d oc (fi n , /i sc ) > e) — 0. 
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Proposition C.2. For every n, s, let A n ^ s = p n>s —-, where p„ iS is a random state with distribution 
H n)S , and let t / sp ( v /^4 n s) be the rescaled empirical spectral distribution. When n and s/n tend to in- 
finity, the sequence (^ S p(y/nsA n a) ) converges to jj, sc , in probability, with respect to the oo-Wasserstein 
distance. 

References for Propositions I C. l\ and \C.B. By Lemma lB.ll convergence to the semicircle distribu- 
tion with respect to the oo-Wasserstein distance is equivalent to weak convergence and convergence 
of extreme eigenvalues. These statements appear separately in random matrix literature. 

Wigner's famous result (see [T]) is that t / sp ( n -i/2 Gn ) converges weakly, in probability, towards /i sc . 
The standard setting is the case of GUE matrices, while here we consider GUE° matrices (i.e., 
GUE matrices conditioned to have trace 0), but it is easily checked that this doesn't affect the 
limit distribution. 

The statement about convergence of limit eigenvalues is also well-known. For example, it is 
known that E ||Gr re ||ao ^ < 2\fn (a proof for GUE matrices can be found in Appendix H of |40| . and 
it extends to GUE via Jensen's inequality). Concentration of measure (e.g., the Gaussian version 
of Lemma IF. 1|) implies then that 

(24) P(||Gf n /v^||oo > 2 + t) ^ exp(-nt 2 /2). 

For Proposition IC.2t we refer to [6], where the weak convergence towards the semicircular distri- 
bution is proved (one should point out that this statement appeared implicitly already in [28]). The 
statement about limit eigenvalues, as well as tail inequalities analogous to ff2"4"|) . can be deduced 
from the techniques from [19] (see also [H], Theorem 2.7). □ 

For every integer n, denote by e R n, ° the "ideally semicircular" vector, i.e. the vector 
X- = (X-,...,X-J such that 

where F sc is the cumulative distribution function of standard semicircular distribution (note that 
the sum of coordinates is indeed zero). Obviously, the sequence (vx™) converges to jj sc in the 
oo-Wasserstein distance. 

Let K be a convex body in Ai^'° (containing in the interior), with || • ||x the corresponding 
gauge function. Define a gauge <fi on R n, ° by setting 

<f)(x) = [ \\UDiag(x)rf\\ K dU, 

JU(n) 

where the integral is taken with respect to the (normalized) Haar measure on the unitary group. 
Let G n be an n x n GUE random matrix, and let U E U(n) be a Haar-distributed random 
unitary matrix independent from G n . By unitary invariance, G n has the same distribution as 
Ir Diag(sp(G n ))£7. Therefore, we have 

E^(sp(G f n ))=E||G re || K . 

Since <fi is convex and invariant under permutation of coordinates, we have <f)(x) ^ 5(x,y)<ft(y) 
for every x, y G R n, °. In particular, if we introduce the random variables a n = 5(sp(-^G n ), X^) 
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and p n = <5pQ c ,sp(^G n )), we have 

< 0(sp(G„)) < v^WH- 

Taking expectation, we obtain 

(25) vWW) E/3" 1 ^ E ||G„|| X < yfaKX?) Ea n . 

Let A n)S = p n ^ s — -, where p n>s is a random state with distribution fi n s . Introduce the random 
variables a' ns = 5(sp(y/nsA n:S ), X^) and f3' ns = 5(X^ c ,sp(y/nsA njS )). Since the distribution of 
A UjS is unitarily invariant, the same argument applies verbatim, and yields the inequalities 

(26) 4=^OT) EGfliJ- 1 < E ||p n , s --\\ K < -^^K c ) E<,. 



By combining ( 1251) and ( 1261) . one obtains the following inequalities for the constants C n>s and 
c niS introduced in Proposition 14.11 



E(t)- 1 . e«: 



It is now immediate to deduce Proposition 14.11 from the following lemma. 
Lemma C.3. In the notation introduced above, we have 

lim Ea n = lim E/3" 1 = lim Ea ns = lim E(^ a ) -1 = l- 

n— >oo n— >oo n,s/n— >oo ' n,s/n—>oo ' 

Proof of Lemma \C. 31 Together with Propositions IC.ll and lC.2[ Lemma fB . 31 implies that a n , /3 n , a' n s 
and (3' n s converge in probability to 1 when n and s/n tend to infinity. 

The convergence of Ea n will follow if we show that, say, sup n Ea^ < +oo. Introduce a vector 
z G R n '° with \n/2\ coordinates equal to 1 and [n/2j coordinates equal to —1 (if n is odd, the 
remaining coordinate is necessarily 0). It is easily checked that 5(z, X^ c ) is bounded by an absolute 
constant C. Moreover, for any x G R n '°, we have 8(x, z) = ||x||oo- It follows that 

in 



and the problem is reduced to showing that sup n E HG^/x/tTH^, < +oo, which follows easily from 
the tail estimates fT24"]) . 

Since a n f3 n ^ 1, convergence of E/3" 1 follows from the convergence of Ea„. The convergence of 
Ea! ns and E(/3^ s ) _1 is proved along the same lines. □ 

Remark. The above arguments are quite flexible and become particularly simple if we do not aim 
at showing that S(-, •) ~ 1, but at an estimate 8(-, •) < 1. For example, the estimates stated in the 
second remark following Proposition 4-1 can be deduced from the known bounds on E || ■ (such 
as (|24|) ) for the ensembles in question and the following elementary fact: if x,y G R n '° are such 
that max,,- \xj\ ^ ^ ^ - \yj\, then x -< 2Cy. 
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Appendix D. Irreducible subspaces for the isometry group of S 

Fix an integer d > 1. Denote by H the (real) vector space Ai s d a , Hq = _M^ a '° the hyperplane of 
trace zero matrices, and Hi = Hq the one- dimensional space of scalar matrices. (Note that this 
notation is not identical to the one used in the main body of the paper.) 

The unitary group U(C d ) acts on H by conjugacy: the action of an element U G U(C d ) is given 

by 

A ^ UAU j . 

Similarly, for any integer k, the direct product U(C d ) k acts on H® k : the action of a fc-tuple 
(Z7i, . . . , U k ) being given by 

(27) A H- (U x ® ■ ■ ■ ® U k )A{U\ ®---®U\) 

(this construction is called "external tensor product" in representation theory). 

When a group Y acts on a (real or complex) vector space E (i.e., if C(E) is the space of linear 
operators on E and n : T — > C(E) is a representation), one says that a nonzero subspace E C E 
is invariant if, for every 7 G r, vr(7)F C F. We say that a nonzero subspace F C E is irreducible 
if, for every nonzero x E F, 

F = span{7r(7)a; : 7 G T}. 

Lemma D.l. Consider the action of U(C d ) k on H® k given by ([27]) . A subspace E C i/ ^ 
irreducible if and only if it has the form 

(28) H n ®---® H ik 

for some choice of (ij, . . . , i^) G {0, l} fc . Moreover, a subspace E C invariant if and only if 

it is the direct sum of subspaces of the form (128j) . 

Before we prove Lemma [D.ll let us state basic results from representation theory (see [37]). 

Proposition D.2. The following results are valid for complex representations of compact groups. 

(i) For every representation ir : T — > C(E), there exists a decomposition of E as direct sum 

(29) E = Q)E a , 

a 

where (E a ) are irreducible subspaces. 

(ii) //, moreover, there does not exist indices a 7^ a' such that the subrepresentations of n into 
C(E a ) and C(E a >) are isomorphic, then the decomposition in ( 1291) is unique. 

(hi) If 7Ti : Ti — t- C(Ei) and 7r2 : — ?• ^(-£^2) two irreducible representations , then the 
(external) tensor product representation 

7Tl ® vr 2 : Ti x T 2 ® E 2 ) 

zs irreducible. 

The statements (i), (ii) and (iii) appear in [37] as Theorems 2, 8 and 10 respectively (note that 
although these theorems are stated in [37] for finite groups, they remain valid for compact groups, 
as noted in Chapter 4). 
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Proof of Lemma \D.1\ Let us switch to the complex field: denote by E c the complexification of a 
real vector space E. Note that H c naturally identifies with the space of complex d x d matrices, 
and (H® k ) c naturally identifies with (H c ) m . 

We first check that the subspaces Hq and are irreducible for the action of U(C d ) on H. 
Obviously is irreducible. Let A G Hq with A ^ and consider the (complex) space 

J 77 = span{U j AU : U G U{C d )}. 

We must show that J 7 = H^. Note that J 7 necessarily contains a matrix with the diagonal non 
identically zero (since one may diagonalize either the Hermitian or the ant i- Hermit ian part of 
A, and one of them is nonzero). Averaging over all diagonal unitary matrices shows then that 
J 7 contains a nonzero diagonal matrix. Since the symmetric group &d acts irreducibly on the 
hyperplane of sum zero vectors in C d , we deduce that A contains every diagonal trace zero matrix, 
and therefore every Hermitian or skew-Hermitian trace zero matrix, so that J 7 = Hq. 

By Proposition ID.2( iii). it follows that for every k G N and every (ii, ...,«&) G {0, l} k , the 
subspace 

is an irreducible subspace in (_£/~ c )® fc for the action of U(C d ) h . Moreover, the 2 k corresponding 
subrepresentations are pairwise non-isomorphic (indeed, for every 1 ^ j ^ k, the index ij can 
be retrieved by looking at the subgroup corresponding to the jth copy of U{C d ): its action on 
Eu x i k ) is trivial iff ij = 1). By Proposition ID.2( ii). this is the unique decomposition of (_£/" c )® fc 
into irreducible subspaces, and any invariant subspace is obtained as the direct sum of some of 
these irreducible subspaces. 

Note that a subspace E C H® k is invariant if and only if its complexification E c C (if c )® fc is 
invariant. This implies the second part of Lemma [D. 11 The first part follows since the irreducible 
subspaces are minimal in the lattice of invariant subspaces. □ 

Appendix E. The ^-position 

Important: In this section we restrict our attention to symmetric convex bodies. While most 
of the concepts and results generalize to the non-symmetric case, it is not known whether the 
central result, Proposition IE.44 holds in that setting. 

Our presentation of ^-position is standard and follows closely the book |34j . Let K C R m be a 
symmetric convex body, and let T : R m — > R m be a linear operator. We define the quantity £r{T) 
as 

£ K (T)=E\\T(G)\\ K , 

where G denotes the standard Gaussian vector in R m (i.e. the coordinates are independent N(0, 1) 
random variables). If there is no ambiguity about the underlying convex body, we write I instead 
of Ik- The following proposition collects elementary properties of this concept. 

Proposition E.l. If K C R m is a symmetric convex body, then 

(i) £k(-) is a norm on C(H m ), 

(ii) £k-(TiT 2 ) ^ ^(Ti)x||2~2||op and £k(TiT 2 ) ^ £f t -(T 2 )||Xi|| op (this is called the ideal property) , 
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(iii) £ K (l) = w G (K°) = lm w{K°) ~ y/mw(K°) and £ K {u) = £ u -iM, 

(iv) If Pe denotes the orthogonal projection on a subspace E C R m , then 

£k{Pe) = w G ((K n E)°) = w G (P E K°), 

where by (K (1 E)° we mean the polar inside E. 

Before proceeding, let us point out that the more common definition of the £-norm is via the second 
moment, namely (E ||T(G) H^-) 1 ' 2 . However, the two expressions are equivalent. 

Proposition E.2. For any symmetric convex body K C R m and for any linear operator T : R m — >■ 
R m ; we have 

E\\T(G)\\ K < (E \\T(G)\\ 2 K ) l/2 < ^E\\T(G)\\ K . 

Proposition IE. 21 (which we do not really need here, but include for clarity) is a special case of 
Corollary 3 in [23] . If we do not insist on obtaining the optimal constant 7r/2, the result is more 
elementary (essentially a special case of the so-called Khinchine-Kahane inequality) and extends 
to non-symmetric convex bodies (see, e.g., [7], Lemma 3.3). 

We now consider the maximization problem 

max detT. 

By compactness, the maximum is attained and it is obviously strictly positive. Since, for V G 0(m) 
and for any T, we have £{T) = £(TV), it follows that it is enough to restrict our attention to T's 
that are positive definite (PD). We claim that, under the PD restriction, the solution T is unique. 
Indeed, if we had any other solution T 1; it would follow that T = (T + 7i)/2 verifies, on the 
one hand, £(T) ^ 1 and, on the other hand, detT > (detT ) 1 / 2 (det Ti) 1 / 2 = detT (by strict 
log-concavity of det over PD), a contradiction. Clearly, £(T ) = 1. 

If the maximum above is attained when T is a multiple of identity, one says that K is in the 
^-position. For every symmetric convex body K, there is a linear transformation T such that TK 
is in the ^-position; moreover T is unique up to rotations and homotheties. 

We will take advantage of the uniqueness of the ^-position through the following lemma 

Lemma E.3. Let K C R m be a symmetric convex body and V be the isometry group of K (i.e. 
the set of orthogonal transformations U such that U(K) = K). 

Then there is a linear map T such that T(K) is in the l-position and which has the form 

i 

for some Aj > and some subspaces (Ei) which are invariant under the action ofT. 

Proof. Let T ^ be the unique solution to the maximization problem (|30|) . For any 7 G V 
and x G R m , we have ||7(Tx)||a: = ||T:e||k-, hence £{^T) = £(T). Since £{TV) = £(T) for any 



(30) 
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orthogonal transformation V, it follows that £(7X7 l ) = i(T). Uniqueness of the solution implies 
that 7T7- 1 = T. Write 

T = ^A i P i) 

i 

where Aj > are distinct positive numbers and Pi pairwise orthogonal projectors. Given i, we 
have 7_P;7~ 1 = Pj for all 7 G T, in particular the range of Pj is invariant under the action of T. □ 

We will use the following theorem which compares the mean width of K and the mean width of 
K° whenever K is a convex body in the ^-position. 

Proposition E.4. Let K C R m be a symmetric convex body in the l-position. Then 

£ K (I) i K °(T)<mlogm, 

or, equivalently, 

w(K) w(K°) < logm. 

This result is known in the Asymptotic Geometric Analysis as the "MM*-estimate" . It follows 
by combining results of Figiel and Tomczak- Jaegermann with those of Pisier, including in particular 
sharp bounds on the so-called "i^-convexity constant." See chapter 3 in [M] for a complete proof 
and references. 

Incidentally, while O(logm) is the optimal general upper bound for the i^-convexity constant 
mentioned above (see [10]), to the best of our knowledge it is not known whether it gives the 
correct order in Proposition IE. 41 The pair gives an example for which w(K)w(K°) is 

of order A/log m. In the non-symmetric case, the ^-dimensional simplex A is a example with 
w (A)w(A°) ~ logm, but known general upper bounds for non-symmetric convex bodies are much 
weaker [7J [36] . 

Note that the lower bound wg{K)wg{K°) > m (or w(K)w(K°) ^ 1) follows simply from 
7m = E |G| < E (\\G\\f ||G||^ 2 ) ^ (E \\G\\ K ) 1 ' 2 (E \\G\\ KO ) 1/2 = vrj^fif) 1 / 2 

Appendix F. The concentration argument 

We first recall the statement of Levy's lemma. In the statements of Lemmas IF. II and IF.2[ P 
stands for the uniform measure on the sphere S m ~ l , normalized so that P(5 ,m_1 ) = 1. 

Lemma F.l (Levy's lemma, [26]). If f : S*" 1 " 1 — > R is an L-Lipschitz function, then for every 
e > 0, 

P({\f - M\ > e}) ^ L7 1 exp(-c 1 me 2 /L 2 ), 
where M is any central value of f , and C±,ci > are absolute constants. 

By a central value of a random variable X we mean either the expectation or the median, or 
more generally any number M such that P(X ^ M) ^ 1/4 and P(X ^ M) ^ 1/4. Any two 
central values for the function / appearing in Levy's lemma differ by at most C^L/v^- 

We will also consider situations in which a function / has a (possibly) large Lipschitz constant, 
while the restriction of / to a large subset has a small Lipschitz constant. The following version 
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of Levy's lemma handles such a case. The trick behind this lemma appeared in [I] and implicitly 
in W\. 



Lemma F.2 (Levy's lemma, local version). Let Q C S m 1 be a subset of measure larger than 3/4. 
Let f : S" 1 ' 1 — > R be a function such that the restriction of f to Q is L-Lipschitz. Then, for every 
e>0, 

P({|/(ar) - M f \ > e}) ^ P(5 m " 1 \ Q) + Ci exp(- Cl me 2 /L 2 ), 
where Mf is the median of f , and Ci,Cx > are absolute constants. 

In Lemma IF.21 the median can be replaced by another quantile (up to changes in the numerical 
constant). However, in general, it cannot be replaced by the mean (we do not assume any regularity 
of / outside Q, therefore the expectation may even fail to be well-defined). Still, more often than 
not, some information about global regularity of / is available and concentration around the mean 
can be inferred; see the comment at the end of the proof or the remark following Lemma \F. 3 1 below. 

Proof. The key point is that in any metric space X, it is possible to extend any L-Lipschitz function 
h defined on a subset Y without increasing the Lipschitz constant. Use, e.g., the formula 

h{x) = inf [h(y) + L dist(x, y)} . 

yeY 

By applying this fact to X = S 1 ™ -1 , Y = Q and h = /|q, we obtain a function / : S" 1 ^ 1 — > R 
which is L-Lipschitz and coincides with / on Q. Moreover, if M = Mf is the median of /, then 

P({/~ ^ M}) > P(fi n {/ > M}) = p(n n {/ ^ M}) ^ 1/4. 

Similarly, P({/ M}) ^ 1/4. Hence M is a central value for /. By Levy's lemma, 

P({\f-M\ >e}) ^C 1 exp(-c 1 m£ 2 /^ 2 )- 

Therefore, 

P({|/-M|>£» < P({/^/})+P({|/-M|>e}) 

< P (^-i \Q) + d exp(- Cl me 2 /L 2 ), 

as claimed. Note that if we know, for example, that / is "reasonably bounded," then we can 
infer that the median M and the mean E/ of / do not differ very much (e.g., \M — E/| < 
||/ — M||oo P(S ,m ~ 1 \ Q)) and deduce a posteriori concentration around the mean. □ 

We are now in position to prove Proposition 14.21 

Proof of Proposition \4 ■ %\ Let p be a random state on C n with distribution By definition, p 
has the same distribution as 

where ip is uniformly distributed on the unit sphere in C n ® C s . Equivalently, p has the same 
distribution as AA\ where A is an n x s matrix uniformly distributed on the Hilbert-Schmidt 
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sphere Shs (h can be identified with the real sphere S 2ns 1 ). Consider the function / : Shs — > R- 
be the function defined by 

(31) f(A) 



n 



K 



Lemma F.3. For every t > 0, denote by Q t the subset 

n t = {AeS HS : Halloo < *}■ 
Then the Lipschitz constant of the restriction of f to Q t is bounded by It jr. 

Remark. In particular, taking t — 1, one obtains that the global Lipschitz constant of f is bounded 
by 2/r. This implies that any two central values for f differ by at most C/(ry/ns). 

Proof. The function / is the composition of several operations: 

• the map A i— > \\A\\k, which is 1/r-Lipschitz with respect to the Hilbert-Schmidt norm. 

• the map A y A — I/n, which is an isometry for the Hilbert-Schmidt norm, 

• the map A h-> AA* , whose Lipschitz constant can be estimated by the following chain of 
inequalities 

\\AA ] -BB ] \\ 2 <: ||A(A t -£t) + (A - B)B% 
(32) ^ WAIUIA^-B^h + WA-BUB 1 '^ 

< (||^||oo + ||^||oo)||^- ^Ih- 

In particular, HA, Be fi, we obtain \\AA* - BB*\\ 2 ^ 2t\\A - B\\ 2 . □ 

We now apply Lemma IF.21 and we obtain that for every rj > 0, 

P(|/ - M| ^ n) < P(S HS \ 0*) + exp(- Cl ns V 2 (2t/ry 2 ). 

If we choose t = 3/y/n, then P(Shs \ ^t) ^ exp(— cs) (this follows from a net argument as 
explained in [1]) and the result follows. □ 
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